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1  Introduction 


The  job  of  the  back-end  of  any  natural  language  interface  is  to  translate  a  logical  description  of  what  the  user 
wants  (a  request)  into  an  efficient  plan  for  fulfilling  that  request.  Typically  the  request  n  to  pr.iduce  data  from  some 
underlying  system:  that  is.  the  database,  applications  program,  or  other  system  with  which  the  user  is  communicating 
by  means  of  the  interface.  There  has  been  a  fair  amount  of  work  on  the  problem  of  natural  language  interfaces  to 
single  underly  ing  systems,  for  example.  [51. 

As  computer  systems  become  more  complex,  there  is  more  opportunity  for  combining  the  strengths  of  more 
than  one  system  in  order  to  perform  a  task  For  example,  one  might  imagine  combining  several  resources  a 
database  for  stoiuig  relational  information  with  an  applications  program  to  perform  calculations  based  on  that 
information,  an  expert  system  to  perform  inferences,  and  a  display  system  to  present  data  in  a  useful  way.  In  such 
an  environment  a  "seamless"  natural  language  interface  can  become  a  very  effective  tool,  allowing  the  user  to 
retrieve  and  manipulate  information  w  ithout  needing  to  pay  attention  to  the  details  of  any  particular  resource. 

The  back-end  of  such  an  interface,  however,  is  necessarily  more  complex:  not  only  must  it  be  able  to  translate 
the  user’s  request  into  executable  code,  but  it  must  also  be  capable  of  organizing  the  various  resources  at  its 
disposal,  choosing  which  combination  of  resources  to  use.  and  supervising  the  transfer  of  data  among  them.  We  call 
this  the  multiple  underlying  systems  (MUS)  problem.  This  document  describes  one  approach  to  the  \fL'S  problem,  a 
MO'S  component  implemented  as  part  of  the  back  end  of  the  Janus  natural  language  interface 

We  begin  in  section  2  with  a  brief  description  of  Janus'  npe  system,  a  component  of  the  semantic 
interpretation  language  (WML.  for  World  Model  Language)  that  plays  an  important  role  m  both  front-end  and 
back-end  processing.  Section  3  describes  the  translation  of  WML  expressions  into  a  simplified,  normalized  form 
Section  4  discusses  the  way  that  the  sy  stem  represents  the  capabilities  of  underlying  systems.  Section  5  describes 
the  algorithm  used  lb.'  finding  an  effective  combination  of  the  services  provided  by  the  underlying  systems,  in  order 
to  satisfy  a  given  request.  Section  6  describes  how  tins  combination  of  resources  is  used  to  produce  an  execution 
plan,  and  section  7  deals  with  the  execution  of  this  plan  and  the  transfers  of  data  this  often  entails.  Finally  .  section  8 
discusses  the  implementation  of  this  approach,  as  well  as  limitations  and  extensions  to  this  work. 


2  The  I  t  pe  System 


The  syntax  of  WML  is  "modelled  after  languages  of  the  typed  lambda  calculus"  (  [2).  p  27*.  The  import  of 
this  is  perhaps  most  concisely  expressed  in  [6[: 

Each  type  expression  [or  ripe)  is  associated  with  a  set  of  entities  or  structures  which  is  termed  its  domain.  Every 
expression  of  the  language ...  can  be  mapped  to  a  ty  pe  expression,  whose  domain  serves  to  delimit  the  range  of  vaJue^ 
the  expression  can  take  on 

A  complete  description  of  the  type  system  associated  with  WML  is  beyond  the  scope  of  this  document,  but  in  this 
section  we  attempt  to  convey  a  sense  of  it  by  means  of  examples 
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The  set  of  types  has  two  major  subsets  --  those  that  are  independent  of  the  domain,  and  those  that  are  specific 
to  the  domain  The  former  set  includes  the  types  TV  (truth- value).  INTEGERS.  STRINGS.  REALS.  TIMES,  and 
WORLDS;  the  latter  consists  of  types  constructed  from  concepts  and  roles  in  the  domain-model  For  example,  if 
VESSEL  and  LOCATION  are  domain-model  concepts,  and  SHIP-LOCATION  and  COMBAT-READY  are  domain 

Denotation 


all  vessels 
all  sets  of  vessels 

all  ordered  pairs  of  vessel,  location 
all  sets  of  such  ordered  pairs 
functions  from  vessels  to  locations 
unary  predicates  on  vessels 

Here  we  present  some  types  together  with  examples  of  logical  subexpressions  has  mg  those  types  <  that  is. 
(TYP EOF  expression)  =  rypei. 

1.  type  VESSEL 

Vincennes 

(IOTA  ?JX1  VESSEL  (COMBAT-READY  ?JX1)) 

2.  type  (S  VESSEL) 

(SETOF  Vincennes  Kennedy) 

(POWER  VESSEL) 

(SET  ?  JX2  VESSEL  (COMBAT-READY  ?JX2) ) 

(IOTA  ?JX1  (POWER  VESSEL)  (COMBAT-READY  ?JXl)) 

3.  type  (FUNC-TYPE  VESSEL  LOCATION) 

SHIP -LOCATION 

(LAMBDA  (? JX3)  VESSEL  (VESSEL-LOCATION  ?JX3) ) 

4.  type  (FUNC-TYPE  VESSEL  TV) 

COMBAT -READY 
(LAMBDA  (?JX4)  VESSEL 

(AND  (EQUAL  (VESSEL-LOCATION  ?JX4)  "HAWAII") 

(COMBAT-READY  ?JX4))) 


3  Normalizing  WML  Expressions 


roles,  then  some  related  types  might  include; 

?YPe 

VESSEL 
(S  VESSEL) 

(TUPLE  VESSEL  LOCATION) 

(S  (TUPLE  VESSEL  LOCATION) ) 
(FUNC-TYPE  VESSEL  LOCATION) 
(FUNC-TYPE  VESSEL  TV) 


WML  input  expressions  are  simplified  and  normalized  before  they  are  further  processed  by  the  multiple 
underlying  systems  (ML'S)  component.  This  simplification  process  has  five  stages 

1.  the  extensionalization  of  intensional  subexpressions. 

2.  the  translation  of  the  entire  expression  into  a  modified  disjunctive  normal  fomi, 

3.  the  elimination  of  unnecessary  equivalences. 
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4.  tte  applicaoon  of  underlying-system-independent  rewrites,  and 

5.  the  use  of  printfunction.s  for  improving  responses 

These  stages  occur  sequentially,  and  —  as  the  system  is  currently  implemented  --  must  be  done  in  the  order  presented 
here.  In  this  section,  we  discuss  each  of  these  stages  in  some  detail 

3.1  k\tensionuli/ino  intensional  Subexpressions 

The  first  stage  in  normalizing  a  WML  expression  is  the  extensionalization  of  the  logical  form's  intension.il 
subexpressions.  Most  underlying  systems,  whether  they  are  databases,  expert  systems,  or  other  applications 
programs,  are  extension.il  Those  that  do  take  time  and/or  possible  worlds  into  account  tend  to  do  it  in  a  very 
discrete  fashion  For  example,  the  Navy  's  Integrated  Data  Base  ( I DB )  divides  the  temporal  continuum  for  ships 
combat-readiness  ratings  into  previous-readiness,  current -readiness,  and  projected-readmess:  a  BBN  object-oriented 
simulator  supports  hypothetical  worlds,  hut  maintains  discrete  world-states  in  a  tree-like  hierarchy . 

This  discreteness  makes  it  possible  to  "extensionali/e"  many  intensional  subexpressions  (ideally  all  of  them' » 
in  the  following  manner  given  an  intensional  context  ( i.e. .  an  intension  together  with  its  time  and  world  indices), 
every  predicate  within  the  intensional  expression  "absorbs''  the  tinie'world  information,  either  <  1  i  by  replacing  it 
with  a  related  but  time-  (or  world-)  specific  predicate,  or  (2)  by  adding  temporal, world-related  information  to  the 
predicate's  argument  list. 

For  example,  the  expression 

((INTENSION  (SHIP -LOCATION  ?JX1  ?JX2) )  13-JULY-1965  DEFAULT-WORLD) 

might,  according  to  the  first  of  these  med.ods,  be  extensionalized  as 

(PAST-SHIP-LOCATION  ?JX1  ?JX2) 

assuming  that  PAST-SMPLOCATh >\  is  a  domain  model  predicate  and  that  there  is  appropriate  machinery  for 
choosing  it  on  the  basis  of  the  time  index  This  approach  loses  information,  of  course  --  such  a  process  of 
extensionalization  could  not  lake  place  unless  it  is  certain  that  no  underlying  system  would  be  interested  in  the 
specific  time  index,  that  is.  that  the  new  predicate  {PASI-SHIPLOCA TIGS')  is  sufficient  for  the  purposes  of  any 
underlying  system. 

Alternatively,  the  indexical  information  could  he  absorbed  into  the  predicate  argument  structure,  as  in 

(SHIP-LOCATION-AT-TIMZ  ?JX1  ?JX2  13- JULY-1965) . 

This  approach  preserves  the  information:  underlying  systems  are  free  to  use  the  time  argument,  to  infer  from  it  a 
predicate  like  PAST -SHIP -LOC  ATIOSI .  or  to  ignore  it  altogether 

At  the  time  of  this  writing,  the  implementation  of  extensionalization  is  incomplete,  and  for  the  moment 
assumes  that  no  predicate  cares  about  time  or  world  indices.  Thus  the  intensional  expression  would  simply  be 
extensionalired  as 

(SHIP-LOCATION  ?JX1  ?JX2) . 
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The  process  \iensionulization  as  characterized  here,  applies  to  more  complex  intensions,  as  well  one 
may  do  a  recursive  walk  through  an  intension,  rewriting  predicates  las  above,  for  example!  on  the  basis  of  the  time 
and  world  indices. 


3.2  Disjunctive  Normal  Form 


The  second  stage  of  normalization  is  the  translation  of  the  extensionalized  WML  expression  into  a  somewhat 
simplified  logical  expression  ui  a  modified  disjunctive  normal  form  iDNT). 


The  expression  is  translated  mto  a  disjunctive  normal  form  for  two  main  reasoas.  We  normalize  the 
expression  (reducing  the  number  of  embedded  subexpressions,  for  e.xamplei  in  order  to  simplify  the  process  ol 
matching  various  pieces  of  it  to  underlying  system  capabilities  We  choose  to  use  a  disjunctive  normal  form 
because: 

•  In  the  simplest  case,  an  expression  in  disjunctive  normal  form  is  simply  a  conjunction  of  clauses,  a 
particularly  easy  logical  form  to  cope  w  ith. 

•  Even  when  there  are  disjuncts.  each  can  be  individually  handled  as  a  conjunction  of  clauses,  and  the 
results  then  combined  together  v  ia  union,  and 

•  Bnnging  disjunctions  to  the  top  level  allows  patterns  to  match  in  many  cases  where  it  would  otherw  ise 
not  be  possible.  For  example,  given  the  tnon-normaiized)  pattern 

(AND  (OR  (IN. CLASS  ?JX1  SUBMARINE) 

(IN. CLASS  ?JX1  AIRCRAFT) ) 

(LENGTH  ?JX1  ? JX2) ) 

a  service  seeking  to  match  the  pattern 

(AND  (IN. CLASS  <x>  SUBMARINE) 

(LENGTH  <x>  <y>) ) 

could  not  match  The  DNF.  on  the  other  hand. 

(OR  (AND  (IN. CLASS  ?JX1  SUBMARINE) 

(LENGTH  ?JX1  ?JX2) ) 

(AND  (IN. CLASS  ?JX1  AIRCRAFT) 

(LENGTH  ?JX1  ?JX2)  ) J 

allows  the  match  to  take  place,  by  keeping  the  relevant  information  together  In  a  disjunctive  normal 
form,  each  disjunct  effectively  carries  all  the  information  necessary  for  a  distinct  .subquery 

A  standard  disjunctive  normal  form  is  a  disjunction  of  conjunctions  of  predicates  or  negated  predicates  no 
variables  in  such  an  expression  are  explicitly  quantified,  and  all  are  assumed  to  be  implicitly  universally  quantified 
Existentially  quantified  vanables  have  been  replaced  by  skolem  terms  ."'noting  some  individual  instantiation  of  the 
van  able 

The  modified  disjunctive  normal  form  differs  from  a  standard  DNF  in  several  respects: 

•  There  is  a  response  clause  for  every  query :  that  is.  an  additional  predicate  whose  arguments  are  the 
vanables  for  w  hich  we  want  returned  values.  In  a  query  requesting  a  set  of  objects  ie  g  "Which  ships 
are  in  the  Indian  Ocean’")  the  argument  in  the  response  clause  will  be  the  vanable  denoting  the  set  in 
question  The  same  is  true  for  queries  requesting  individuals  (e  g.  "the  ship  whose  speed  is  30  knots  ); 
the  resulting  logical  form  will  seek  all  possible  individuals  that  meet  the  same  description 
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In  a  yes/no  or  existential  query.  the  response  clause  vx  ill  contain  all  variables  in  the  query  ,  since  any 
instantiation  of  all  the  query  variables  means  an  affirmative  answer:  the  inability  to  find  am  su^h 
instantiation  means  an  answer  in  the  negative.  In  such  cases,  a  particular  variation  on  the  response 
predicate  is  used  rather  than  using  the  special  predicate  RESPONSE.  we  use  the  special  predicate 
V  AL i  E -EX  IS  IS-RfSPi  l\S /  instead,  this  preserves  the  information  that  the  query  s  intent  is  to  find  out 
if  values  exist,  rather  than  to  have  them  returned  1 

•  .411  functional  terms  must  appear  as  predicates:  if  P  is  a  binary  predicate  and  Q  is  a  unary  function,  then 
Pix.  Qiyn  must  appear  as  Ptx.  :i  and  Q'<\.  :t.  where  Q'ty,  zi  is  true  iff  Q(y)=z  For  example,  the 
clause 

(GREATER-THAN  (SPEED-OF  ?JX1)  30) 

will  appear  as 

([AND]  (SPEED-OF  ?JX1  ?JX2) 

(GREATER-THAN  ?JX2  30) ) 

•  Similarly,  provision  is  made  for  complex  terms  like  database  aggregates:  for  example,  cardinality, 
average,  and  sum  Such  complex  terms  may  only  appear  as  the  first  argument  to  a  special  predicate 
called  /S-TERM.  the  second  argument  is  always  a  variable  that  represents  the  term  For  example,  if  the 
logical  expression  asks  for  the  cardinality  of  the  ships  in  the  Indian  Ocean,  we  would  use  the  following 
clause 

(IS-TERM  #S (CONTEXT 

: OPERATOR  CARDINALITY 
: OP  E  RATOR - VAR  ?JX2 
.CLASS-EXP 

((IN. CLASS  ? JX2  SHIP) 

(SHIP-LOCATION  ?JX2  "INDIAN  OCEAN"))) 

?JX1) 

(RESPONSE  ?JX1) 

•  There  is  no  implicit  assumption  of  universal  quantification  for  unquantified  variables  --  expressions  in 
the  modified  DNF  may  contain  universal  quantification. 

•  Existential  quantifiers  are  removed,  not  by  replacing  existentially  quantified  variables  with  skoleni 
terms,  but  simply  bv  removing  the  explicit  existential  quantification.  The  resulting  unquantified 
variables,  along  with  all  other  unquanutied  variables  in  the  form,  are  considered  to  be  query  quantified 

The  term  qua\  quantified  refers  to  variables  for  which  we  would  like  to  get  all  possible  instantiations 
ouch  variables  an*  neither  existentially  quantified  (since  we  re  interested  in  ali  instantiations)  nor 
universally  quantified  'Since  universal  quantification  has  no  notion  of  returning  values i.  this  kind  of 
quantification  is  more  like  that  of  the  vanables  in  a  PROLOG  expression 

Notice  that,  because  universal  quantifications  are  not  removed,  there  is  no  need  to  skolemi/e 
existentially  quantified  vanables  appeanng  within  the  scope  of  universal  quantifiers. 

The  following  is  a  specification  of  the  modified  disjunctive  normal  form  Square  brackets  ([ji  indicate 
optional  elements  Contexts  are  objects  with  internal  components  (implemented  as  LISP  structures)  --  these  objects 
represent  distinct  logical  environments  whose  internal  components  must  be  kept  separate  from  the  remainder  of  the 
expression 

expression  ([AND]  clause4  [response-clause]) 


'A  voopcrjti'.f  sssicrn  mj\  still  return  the  *  jlues 


it  lhr\  exist.  f«*r  example  Are  an\  ships  Cl  might  lejii  to  ihc  response  Yes 


the  Cl  ships 
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clause  (predicate  arg*)  | 

context-clause  | 

is-term-clause  | 

in-class-clause 

response-clause  :-  ("ESPONSE  var*)  |  (VALUE-EXISTS-RESPONSE  var*) 

context-clause2  :-  disjunction-context  | 

negation-context  | 
quantifier -context 

disjunction-context  :-  #S (CONTEXT 

: OPERATOR  OR 

: CLASS-EXP  (clause4) 

[.FREE-VARS  (var+) ] 

[ : LOCAL-VARS  (var+) ]  ) 


negation-context  :-  #S (CONTEXT 

: OPERATOR  NOT 

: CLASS-EXP  clause 

[  ■.FREE-VARS  (var*)  ] 

[: LOCAL-VARS  (var*) ]  ) 

quantifier-context  :-  #S (CONTEXT 

: OPERATOR  FORALL 

: OPERATOR-VAR  var 

•.CLASS-EXP  expression 

.•CONSTRAINT  expression 

[: FREE-VARS  (var*) ] 

[: LOCAL-VARS  (var*)]  ) 

is-term-clause  :-  (IS-TERJM  term-context  var) 

term-context  : -  #S (CONTEXT 

: OPERATOR  term-context -operator 

: OPERATOR-VAR  var 

:CLASS-EXP  expression 

: CONSTRAINT  expression 

: STAT-VAR3  var 

[: FREE-VARS  (var*) ] 

[: LOCAL-VARS  (var*)]  ) 

in-class-clause  (IN. CLASS  var  simple-type) 

arg  :-  var  (  constant 

var  ?JX1  |  ?JX2  j  ?JX3  |  ... 


;Notise  that  the  sonrents  of  a  context  object  depends  upon  the  operate-  for  example,  in  a  quantifier  context  teg  H>RAI.L'  the 
class-expression  field  ulass-expt  is  an  expression,  svhereas  sxiihin  a  negation  eontext.  that  Ticld  contains  a  clause 

’This  field  is  not  currenth  used  hut  is  intended  to  p*-  >\  ide  a  place  to  store  in  additional  sanahle,  it  (Ik-  ss  ntax  of  the  operator  require-  ■’ 
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term- context -ope rat or 
simple-type  :  - 

predicate 
constant 


CARDINALITY  |  AVG  |  SUM  |  ... 

Jomain-nuiJt  !-cuth  ept-nunw  |  r\pt-s\slem-^.\:\u  -,,p  t 
donuan-mcdd-ndv-name  |  npe-s\slem-juni  -t\pt’ 
npf-SMUrn-unliuduii!  |  string  \  number 


3.3  Kliminalin^  I  titmalencex 


Occasionally,  a  logical  expression  w  ill  include  unnecessary  equivalences  between  terms  --  equating  a  constant 
with  a  variable,  for  example,  and  then  using  the  variable  elsewhere  in  the  expression  where  the  constant  would  do 
just  as  well  It  is  helpful  to  eliminate  such  equivalences  early  in  processing. 


Consider  the  following  expression,  resulting  from  the  query  "What  :ve  the  readinesses  of  the  cruisers  that  are 
not  Cl?": 

(AND  (EQUAL  '’JX109  7JX110) 

(IN. CLASS  7JX111  CRUISER) 

(VESSEL-OVERALL-READINESS-OF  7JX111  7JX110) 

#S (CONTEXT  : OPERATOR  NOT 

: FREE-VARS  (7JX110) 

■•LOCAL- VARS  NIL 
: OPERATOR -VAR  NIL 
: CLASS-EXP  (EQUAL  7JX110  Cl) 

: STAT-VAR  NIL 
.-CONSTRAINTS  NIL) 

(IN. CLASS  ? JX110  READINESS -RATING) 

(RESPONSE  ? JX109) ) 


Here  the  variables  ’JX109  and  7JX 110  are  equated,  the  elimination  of  that  equivalence  simplifies  the 
expression  without  changing  its  meaning  The  resulting  expression  is 

(AND  (IN. CLASS  7JX111  CRUISER) 

(VESSEL-OVERALL-READINESS-OF  7JX111  7JX207) 

#S (CONTEXT  : OPERATOR  NOT 

: FREE-VARS  (7JX207) 

: LOCAL -VARS  NIL 

: OPERATOR- VAR  NIL 

: CLASS -EXP  (EQUAL  7JX207  Cl) 

: STAT-VAR  NIL 
: CONSTRAINTS  NIL) 

(IN. CLASS  7JX207  READINESS-RATING) 

(RESPONSE  7JX207) ) 

Notice  that  a  new  \anable.  ’JX207.  has  replaced  the  equivalence  class  |?JX109.  ’JX1 10). 

Not  all  statements  of  equably  are  unnecessary  ;  for  example,  the  query  Are  there  (exactly  i  three  Cl  cruisers  ' " 
results  in  the  following  expression: 
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(AND  (VALUE-EXISTS-RESPONSE  7JX210) 

(IN. CLASS  7JX210  CRUISER) 

(IS-TERM 

#S  (CONTEXT  CARDINALITY 

: FREE-VARS  NIL 
: LOCAL -VARS  NIL 
: OPERATOR- VAR  ?JX210 
: CLASS-EXP  NIL 
:  STAT-VAR  NIL 
: CONSTRAINTS  NIL) 

7JX219) 

(EQUAL  7JX21 9  3) 

(VESSEL-OVERALL-READINESS-OE  7JX210  Cl)) 

Here  the  clause  i  EQL  .\L  r‘J\2IQ  Ji  ts  not  an  equivalence  that  can  he  eliminated 


3.4  System-independent  Rewrites 


At  this  stage  of  the  normalization  process,  the  system  permits  the  application  of  obligatory  rewrite  rules. 
These  rules  must  be  independent  of  the  underly  ing  systems  both  pattern  and  result  must  consist  of  donhiin-niixjcl 
information,  and  they  may  not  contain  any  references  to  structures  or  data  in  the  underlying  system! s). 


Rewrite  patterns  may  seek  to  match  both  simple  clauses  (i  e  .  those  that  are  not  contexts;,  and  context-clauses. 
Similarly,  results  may  be  either  contexts  or  simple  clauses.  For  example,  the  following  rewrite  might  be  used  if  it 
was  know  n  that  the  number  of  subordinates  of  a  manager  corresponded  to  the  number  of  employees  in  a  manager's 
department: 

(define -simple -rewrite4 

:pattern  ((in. class  x  manager) 

( ; context 

; operator  CARDINALITY 
:class-exp  ((in. class  y  person) 

(subordinate-to  y  x) ) ) ) 

•.result  ((in. class  x  manager) 

(department-of  x  z) 

(employee -count  z  y) )  ) 

Ry  using  this  rewrite  rede,  we  transform  a  query  in  which  one  actually  counts  elements  in  a  set  (via  the 
cardinality  tenni  into  one  in  which  a  single  table  lookup  is  used  instead 


*In  simple  femmes  v  \.  /.  ami  *  ire  alw.ivs  variables 
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3.5  Print  functions 

Often  the  logical  content  of  a  query  does  not  reflect  its  desired  interpretation  For  example,  a  query  ax  simple 
as  Last  the  cruisers."  it  interpreted  literally,  produces  a  listing  of  the  databases  internal  representation  for  each 
cruiser  In  the  Navy's  IDB  domain,  this  representation  is  a  number  called  an  HID  -  a  number  that  is  almost  certain 
to  be  completely  useless  to  the  user  as  a  means  of  ship  identification.  What  one  would  rea!!\  like  is  for  the  sy  stem 
to  be  smart  enough  to  interpret  the  question  as  "List  the  names  of  the  cruisers  "  Pnntjuncitons  provide  just  that 
functionality . 


The  pnntfunction  machinery  is  quite  simple.  With  certain  classes  of  objects  (e  g.,  the  domain-model  concept 
\  ESSEL)  one  associates  a  specification  for  how  members  of  that  class  should  be  presented  to  the  user,  called  a 
pnntfunctions  list.  Each  element  of  the  pnntfunctions  list  (each  pnntfunction  >  is  either  1 1 1  the  name  of  a  domain- 
model  role,  or  ( 2  >  the  special  symbol  :IDENT1TV.  As  a  postprocessing  step  of  the  normalization,  the  variables  on 
the  response  List  are  examined,  and  a  new  response  list  created  as  follow  s: 

Let  the  new  response  list  begin  as  an  empty  list 
Tor  each  variable  i  on  the  original  response  list 
Let  tspe  be  the  variable's  type 

If  rape  has  no  printfunction  list  associated  with  it 
Add  i  to  the  new  response  list 
Else 

Tor  each  element  pfn  in  the  type' s  printfunction  list 
If  pfn  is  :  IDENTITY 

Add  v  to  the  new  response  list 
Else  pfn  is  a  domain  model  role: 

Let  new  be  a  new  variable 

Add  the  clause  (pfn  \  new i  to  the  query  itself 
Add  the  variable  new  to  the  new  response  list 


Pnntfunction;,  arc  inherited  --  in  the  examples  in  the  following  section,  responses  involving  ship  classes  like 
cruiser  and  aireraft-camer  are  always  expressed  as  responses  involving  the  names  of  the  ships  because  the  class 
l  ESSEL  (the  top-level  class  for  shipsi  has  the  pnntfunction  list  tSAMEOFi  associated  with  it 


3.6  Examples  of  Normal  Form 

the  examples  in  this  section  consist  of  a  WML  expression,  followed  by  its  DNF.  and  then  followed  by  the 
normalized  form  after  rewrites  and  pnntfunctions  have  applied  (if  there  was  any  change ). 

I.  "List  the  ships." 
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(BRING-ABOUT 
(  (INTENSION 

(EXISTS  ?JX1  LIST 

(OBJECT. OF  7JX1  (IOTA  7JX2  (POWER  VESSEL)  T) ) ) ) 
TIME  WORLD) ) 


Normalized  expression: 

(AND  (IN. CLASS  ?JX1  LIST) 

(IN. CLASS  ?JX2  VESSEL) 
(OBJECT. OF  ?JX1  ?JX2) 

(MEMBER  ?JX1  ? JX3) 

(IN. CLASS  ? JX3  (POWER  EVENT)) 
(RESPONSE  ?JX3) ) 


Expression  after  rewrites  and  printfunctions  have  applied: 

((RESPONSE  ?JX70) 

(NAMEOF  ? JX2  7JX70) 

(IN. CLASS  ? JX2  VESSEL)) 

2.  "Which  ships  are  Cl?" 

(QUERY 

( (INTENSION 
(PRESENT 

(INTENSION 

(IOTA  7JX4  (POWER  VESSEL) 

(VESSEL-OVERALL-READINESS-OF  7JX4  Cl))))) 

TIME  WORLD) ) 

(AND  (IN. CLASS  ?JX4  VESSEL) 

(VESSEL-OVERALL-READINESS-OF  ?JX4  Cl) 

(RESPONSE  ?JX4) ) 

((RESPONSE  7JX72) 

(NAMEOF  7 JX4  7JX72) 

(IN. CLASS  7JX4  VESSEL) 

(VESSEL-OVERALL-READINESS-OF  7JX4  Cl)) 

3.  "Which  cruisers  are  not  Cl 1M 
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(QUERY 

( (INTENSION 
(PRESENT 

(INTENSION 

(iota  ? jx8  (pcwdf.  cruiser; 

(NOT  (VESSEL -OVERALL. -READINESS  -OF  ?JX8  Cl)))))) 
TIME  WORLD) ) 

(AND 

(IN. CLASS  ?JX8  CRUISER) 

#S  (CONTEXT 

: OPERATOR  NOT 
: FREE-VARS  (?JX8) 

: LOCAL -VARS  NIL 
: OPERA TOR-VAR  NIL 

: CLASS -EXP  (VESSEL-OVERALL-READINESS-OF  ?JX8  Cl) 
: STAT-VAR  NIL 
: CONSTRAINTS  NIL) 

(RESPONSE  ?JX8) ) 

((RESPONSE  ? JX73) 

(NAMEOF  ? JX8  ? JX73) 

(IN. CLASS  ? JX8  CRUIi. 2R) 

#S (CONTEXT 

: OPERATOR  NOT 
: FREE-VARS  (?JX8) 

: LOCAL-VARS  NIL 
.•OPERATOR- VAR  NIL 

: CLASS -EXP  (VESSEL-OVERALL-READINESS-OF  ?JX8  Cl) 

: STAT-VAR  NIL 
: CONSTRAINTS  NIL)) 

4.  "Are  any  earners  harpoon  capable?" 

(QUERY 

( (INTENSION 
(PRESENT 

(INTENSION 

(EXISTS  ? JX20  (POWER  AIRCRAFT-CARRIER) 
(HARPOON-CAPABLE-VESSEL  ?JX20))))) 

TIME  WORLD) ) 

(AND  (VALUE-EXISTS -RESPONSE  7JX20) 

(IN. CLASS  ?JX20  AIRCRAFT-CARRIER) 
(HARPOON-CAPABLE-VESSEL  ? JX20) ) 

( (VALUE-EXISTS-RESPONSE  ?JX74) 

(NAMEOr  ? JX20  ?JX74) 

(IN. CLASS  ? JX20  AIRCRAFT-CARRIER) 
(HARPOON-CAPABLE-VESSEL  ?JX20) ) 


5.  "Are  the  cruisers  and  the  carriers  c  1  ?" 
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(QUERY 

( (INTENSION 
(PRESENT 

(INTENSION 

(VESSEL-OVERALL-READ INESS-OF 
(SETOF 

(IOTA  ? JX56 
(POWER 

(SET-TO-PRED 

(IOTA  ? JX59  (POWER  CRUISER)  T)  )  ) 

T) 

(IOTA  ?  JX57  (POWER  AIRCRAFT-CARRIER)  T) ) 

Cl))) ) 

TIME  WORLD) ) 

(#S (CONTEXT 

•.OPERATOR  OR 
: FREE-VARS  (?JX64) 

: LOCAL -VARS  (?JX56  ?JX59  ?JX57) 

.•OPERATOR- VAR  NIL 
.•CLASS -EXP 

((AND  (EQ  ?  JX64  ?JX56) 

(IN. CLASS  ? JX5 6  CRUISER)) 

(AND  (EQ  ?  JX64  ?JX57) 

(IN. CLASS  ? JX57  AIRCRAFT-CARRIER))) 
.•STAT-VAR  NIL 
.•CONSTRAINTS  NIL) 

(VESSEL-OVERALL-READINESS-OF  ? JX64  Cl) 
(VALUE-EXISTS-RESPONSE  ?JX64) ) 

6.  How  many  cruisers  are  in  the  Indian  Ocean'.’" 

(QUERY 

( (INTENSION 
(PRESENT 

(INTENSION 

(CARD  (IOTA  ? JX65  (POWER  CRUISER) 

(IN. PLACE  7JX65  INDIAN . OCEAN) )))) ) 

TIME  WORLD)  ) 

(AND  (IS-TERM 

#S (CONTEXT 

: OPERATOR  CARDINALITY 
: FREE-VARS  NIL 
: LOCAL -VARS  NIL 
: OPERATOR- VAR  7JX65 

: CLASS-EXP  ((IN. CLASS  ?JX65  CRUISER) 

(IN. PLACE  ? JX65  INDIAN. OCEAN) ) 
: STAT-VAR  NIL 
: CONSTRAINTS  NIL) 

? JX69) 

(RESPONSE  ? JX69) ) 
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4  Servers  and  Ser\  ices 


In  on  environment  with  multiple  under!)  mg  systems,  one  must  have  a  uniform  way  to  describe  the  capabilities 
of  each  underlying  system.  We  adopt  terminology  similar  to  that  of  [3 1  and  [4  j 

A  sener  is  a  functional  module  typically  corresponding  to  an  underlying  system  or  a  major  pan  of  an 
underlying  system.  In  the  application  of  the  MVS  system  being  descnbed  here,  there  are  two  servers  --  one  named 
:ERL.  which  supports  access  to  a  relational  database,  and  one  called  :LISP,  which  supports  calls  to  arbitrary  LISP 
functions.  Each  server  has  associated  with  it: 

LA  number  of  sesues:  objects  describing  a  particular  piece  of  functionality  provided  by  a  server. 
Specifying  a  ser.ice  in  MVS  provides  the  mapping  from  fragments  of  logical  form  to  fragments  of 
underlying  sysu  m  code 

2.  An  execution  planner  a  function  that  takes  a  piece  of  the  solution  to  a  query  (see  section  5 )  and  builds 
from  it  a  partial  execution  plan  (see  section  6) 

3.  An  executor:  a  function  that  takes  a  partial  execution  plan  together  with  input  data,  executes  the  plan, 
and  produces  output  data  (see  section  7 ). 

A  serv  ice  is  an  object  consisting  of  the  follow  ing  components: 

•  Name  a  symbol  used  to  uniquely  identify  the  service 

•  Owner:  the  name  of  the  server  to  which  this  service  belongs 

•  Cost,  a  scalar  value  indicating  the  cost  of  this  serv  ice:  if  unspecified,  unit  cost  ( 1  *  is  assumed. 

•  Inputs  a  list  of  pattern  variables,  each  of  which  has  associated  with  it  a  name,  a  type,  and  a  constraint. 

The  type  indicates  the  extent  to  which  the  input  is  optional  a  type  of  :GEH  indicates  that  input  to  this 
variable  is  optional,  since  this  service  can  generate  values  for  the  variable;  a  type  of  TEST  indicates 
that  input  must  be  provided  for  the  variable,  since  this  service  is  only  capable  of  applying  some  test  to 
the  input  values:  a  type  of  TEST-ALL  indicates  not  only  that  input  to  this  variable  is  obligatory,  but  that 
by  the  time  the  data  for  this  variable  reaches  this  service  it  must  be  filtered  as  completely  as  possible  -- 
this  is  often  the  type  for  inputs  to  services  that  do  response  presentation,  for  example 

The  constraint  associated  with  the  variable  is  used  for  pattern-matching.  The  possible  constraints 
include: 

1. 1  symbol*  y.  a  list  of  symbols.  Items  matching  this  variable  must  be  EQ  to  a  symbol  on  the  list. 

2.  (string*  y.  a  list  of  strings.  Items  matching  this  variable  must  be  STR1NG=  to  a  string  on  the 
list. 

3.  type :  a  simple  type.  An  item  will  match  this  variable  if  the  type  (i.e  .  type-system  type  —  see 
section  2)  of  the  item  is  a  subtype  of  type.  (The  type  of  a  Janus  variable  i  is  type  if  the  clause 
<l.\. CLASS  v  type)  appears  in  the  (normalized)  query.)  This  constraint  does  not  pay  attention  to 
whether  or  not  a  type  denotes  a  set  --  if  type  is  (S  SHIP)  (a  set  of  ships),  an  item  with  type  SHIP 
will  match,  and  vice-versa. 

4.  function:  a  function  than  takes  one  argument.  An  item  item  will  match  this  v  unable  li  if  uncoil 
function  item i  returns  a  non- NIL  value. 

5.  (SUBTYPE -OF  type):  a  subtype  specification  Items  matching  this  variable  must  themselves  V 
types  in  the  type  system:  furthermore,  they  must  be  subtypes  of  type.  For  example,  a  variable 
wtth  constraint  tSUBTYPE-OF  SHIP)  would  match  CRUISER  (since  CRUISER  is  a  subtype  of 
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SHIP).  Notice  how  this  differs  from  (3t.  above:  there  the  constraint  is  that 

(SUBTYPE  iTVPEOF  item)  npei  must  be  tme.  whereas  here  the  constraint  is  that 
(SUBTYPE  item  npei  must  be  true 

6.  ML.  ASYTYPE.  T  these  will  match  anything 

•  Outputs:  a  list  of  pattern  variables,  identifying  the  outputs  of  the  service.  Outputs  need  not  have  been 
inputs,  nor  must  inputs  to  the  service  also  be  outputs. 

•  Pattern:  a  pattern  specification  which  will  match  some  piece  of  the  logical  form  The  pattern 
specification  must  be  a  list,  each  element  of  which  is  the  partem  for  either  a  simple  clause  or  for  a 
context-clause.  Within  patterns,  one  can  not  have  a  variable  predicate:  however,  the  arguments  to 
predicates  must  be  pattern  variables  (see  inputs ,  above,  for  a  descnption  of  how  to  constrain  what  these 
variables  may  match). 

A  pattern  specification  for  a  context-clause  ( context-spec )  takes  one  of  two  forms. 

(:  context  :  operator  operator 

[  :£ree-vars  (ur*)  ] 

(  :  operator-vsr  var] 

[:stat-var  var] 

[:  class-exp  expression] 

[  :  constraint  s  expression]  ) 

For  this  form  of  partem  specification,  the  context-spec's  operator  must  match  the  context's  operator, 
and  recursive  calls  to  the  matcher  must  return  successfully  for  the  :class-exp  and  xonstraints. 

(:  context  :  operator  operator 

[  :  covers -owner  sener-name]  ) 

This  second  form  of  pattern  specification  allows  one  to  say.  "This  service  will  match  jm  context  whose 
operator  is  operator,  as  long  as  there  are  solutions  of  the  :class-exp  subexpression  and  of  the 
xonstraints  subexpression  such  that  both  solutions  belong  entirely  to  server  sener-name."  For 
example,  a  context-spec  for  operator  C.^RDINALETY  specifying  that  it  covers  owner  ERL  says,  in 
effect.  'This  serv  ice  can  take  the  cardinality  of  any  set.  as  long  as  that  set  can  be  obtained  entirely  by 
calls  within  the  :ERL  server."  This  is  a  useful  method  of  providing  general  services  that  handle 
aggregate  operations  within  a  single  server. 

•  Method:  a  code  fragment  or  other  information  that  the  server  will  use  in  generating  a  partial  execution 
plan  from  a  solution  that  utilizes  this  service.  WTiat  goes  in  the  method  slot  depends  enurely  on  the 
particular  server  to  which  the  service  belongs. 


For  fast  access,  services  are  indexed  by  the  predicates  in  their  partem.  That  is.  for  every  clause  iP  x  \  )  in  the 
pattern  of  some  service  S.  there  is  a  pointer  from  the  symbol  P  to  the  serv  ice  S.  An  exception  to  this  is  the  IS. CLASS 
predicate:  if  a  service's  pattern  includes  HN.CLASS  x  Cl,  the  pointer  will  be  from  the  symbol  C  rather  than  the 
symbol  IN. CLASS:  that  is.  the  indexing  proceeds  as  if  the  clause  were  C(xi. 


As  an  example,  consider  the  service-object  corresponding  to  the  ERL  server  s  ability  to  access  a  table 
associating  ships  with  overall  combat-readiness  values: 
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NAME  : 
OWNER: 
INPUTS : 
OUTPUTS : 
PATTERN: 
COST  : 
METHOD : 


VESSEL-OVERALL- READINESS -OF 8  5  9 
:  ERL 

(<x>  <y>) 

(<x>  <y>) 

(  (VESSEL-OVERALL-READINESS-OF  <x>  <y>) ) 
NIL5 

(((VESSEL-OVERALL-READINESS-OF  X  Y)  ) 
(BINDTOERL  ((X  IUID)  (Y  RDY) )  IID.RDY)) 


The  name  and  owner  fields  are  straightforward:  the  service  has  a  unique  name  and  belongs  to  the  server 
named  :ERL  (in  a  current  implementation.  ERL  is  the  server  that  can  access  the  Navy's  relational  database).  The 
pattern  is  also  particularly  simple,  a  single  clause  Note  that  the  vanables  printed  as  <x>  and  <y>  are  objects: 

NAME:  X 

TYPE:  : GEN 

CONSTRAINT:  VESSEL 


NAME :  Y 

TYPE:  : GEN 

CONSTRAINT:  READINESS-RATING 

Because  both  are  type  .GEN,  this  service  does  not  require  input  values  for  these  variables.  The  pattern  will  match 
clauses  only  when  the  type  of  the  first  argument  (matching  <x>)  is  VTESSEL.  and  the  type  of  the  second  argument 
(matching  <y>)  is  READINESS-RATING. 


The  method  field  for  services  belonging  to  the  :ERL  server  contains  two  pieces:  first,  the  pattern  that  was 
matched:  second,  a  code-like  fragment  that  relates  vanables  to  fields  and  specifies  a  table  from  which  to  draw  those 
fields  6 

The  scheme  for  indexing  services  establishes  a  pointer  from  the  symbol 
VESSEL-OVERALL-READINESS-OF  to  this  service 


5  Formulation 


The  job  of  the  formulation  algorithm  is  to  locate  all  services  that  might  be  resources  for  satisfying  a  request, 
and  find  the  best  possible  combination  of  services  from  that  set.  where  "best "  typically  means  lowest-cost.  This  is 
inherently  a  search  problem  7  Previous  approaches  to  the  formulation  problem  have  included  using  NIKL 


'The  cost  field  is  unspecified,  therefore  this  vers  ice  is  assumed  to  have  unit  cost. 

^This  is  a  simplification:  the  table  specification  may  be  a  fragment  of  ERL  code,  complete  with  JOINs.  SELECTs,  etc. 

The  formulation  problem.  when  us  input  is  a  conjunction  of  non-negated  simple  clauses,  can  be  sieved  as  a  kind  of  set -covering  problem 

(SCI5),  which  is  NP -complete  [IJ  The  SCPon  he  formulated  as  follows:  given  a  set  S  =  (\r  . s^j  and  a  collection  C  =  (Cr  C, . Cm) 

such  that  each  C  is  a  t propen  subset  of  S  and  each  C  has  a  positive  cost  c  find  the  subset  C‘  of  C  such  that  <  t  \  the  union  over  C  equals  S.  and 
<  Z  >  the  sum  of  the  costs  over  C  is  minimised  In  (he  formulation  problem.  S  is  the  set  ot  clauses,  jnd  each  element  of  C  is  a  serv  ice . 
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classification  [4]  and  a  kind  of  A*  search  [5]  The  approach  here  resembles  a  beam  search,  and  uses  a  greedy 
heunsfic. 

The  first  two  subsections  describe  two  objects,  partial  solutions  and  marches,  that  are  important  at  the' 
implementation  level:  the  third  section  describes  the  formulation  algorithm 

5.1  Partial  Solutions 

The  main  structure  used  in  the  formulation  stage  is  the  partial  solution,  an  object  used  in  building  up  a 
collection  of  services  for  a  given  input  expression.  Initial  partial  solutions  are  created  from  individual  services: 
otherwise  partial  solutions  are  created  by  combining  other  partial  solutions.  Each  of  these  obiects  has  the  follow  ing 
components. 

•  Expr:  clauses  from  the  logical  form  that  this  solution  does  nor  cover  In  the  empty  solution  th/s  field  is 
equal  to  the  input  expression:  in  a  complete  solution  this  field  is  empty. 

•  Cost:  the  combined  cost  of  all  the  component  solutions  making  up  this  solution. 

•  Input-links:  the  mapping  from  variables  in  the  logical  expression  to  variables  in  the  solution's 
servicets). 

•  Output-links:  the  mapping  from  variables  in  the  solution's  service(s)  to  variables  in  the  logical 
expression 

•  Internal-partials:  the  collection  of  initial  (sometimes  also  called  pnmitne >  panial  solutions  from 
which  this  panial  solution  was  constructed. 

•  Matches  the  collection  of  match  objects  belonging  to  this  panial  solution.  There  is  one  match  object 
for  each  clause  matched  by  a  service.  See  section  5.2. 

•  Local-matches:  this  field  is  currently  unused,  but  is  intended  for  use  when  optional  system -dependent 
rewrites  are  introduced. 

•  Goodness:  value  based  upon  the  sum  of  the  cost  of  component  solutions  and  other  factors  (e.g. 
communication  cost  between  component  solutions) 

5.2  Matches 

A  match  is  an  object  built  during  pattern-matching  Matches  are  also  used  dunng  later  stages  of  processing, 
since  they  provide  the  link  between  a  partial  solution  and  its  component  services.  Each  match  object  compnsex: 

•  the  serv  ice,  part  of  whose  pattern  was  matched 

•  a  "name",  consisting  of  a  list  of  the  service  and  an  instance  number,  used  to  distinguish  different 
instances  of  the  same  service  (for  example,  when  one  service  matches  two  different  parts  of  the  same 
pattern  —  see  page  19 1 

•  the  clause  in  the  input  expression  that  was  matched 

•  the  clause  in  the  service's  pattern  that  did  the  matching 

•  the  variable  mappings  produced  by  the  match  (see  also  input-links  and  output-links,  above) 
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•  embedded  partial  solutions  created  via  recursive  call's)  to  the  formulation  algonthm.  applicable  onh 
when  the  matched  clause  is  a  context,  and  therefore  has  internal  subexpressions. 

•  embedded  variable  mappings  associated  with  the1  embedded  partial  solutions 


For  example,  consider  a  service  named  X.WfEOFJJX.  whose  pattern  is  ((NAMEOF  <x>  <yx >.  and  an  input 
expression  containing  the  clause  iX.WfEOE  ,'JXI  "l  fSCFX'XES  'i.  The  match  object  created  by  the  pattern  matcher 
will  look  like: 


SERVICE : 

VAR -MAPPINGS : 
PATTERN : 

CLAUSE : 

EMBEDDED-SOLUTIONS : 
EMBEDDED -MAP PINGS : 


[Service:  NAMEOF 328 ] 
(("VINCENNES”  .  Y)  (?JX1  .  X)  ) 
( (NAMEOF  <x>  <y>)  ) 

(NAMEOF  ?JX1  "VINCENNES") 

NIL 

NIL 


As  a  more  complex  example,  consider  a  service  named  GEXERAL-CARDIXAUTY .  belonging  to  the  LISP 
server,  that  matches  cardinal it>  expressions.  The  query  '  How  many  Cl  cruisers  are  there.’"  produces  the  request 

(AND  (IS-TERM 

#S (CONTEXT 

: OPERATOR  CARDINALITY 
: FREE -VARS  NIL 
: LOCAL -VARS  NIL 
: OPERATOR-VAR  ?JX1 

: CLASS-EXP  ( (IN . CLASS  ?JX1  CRUISER) 

(VESSEL-OVERALL-READINESS-OF  ?JX1  Cl)) 

: STAT-VAR  NIL 
.•CONSTRAINTS  NIL) 

?  JX5) 

(RESPONSE  ? JX5 ) ) 


The  match  object  created  b\  matching  the  GEXER.\L-C.-\RDIX.\Lm'  service  to  the  first  clause  is 
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SERVICE:  [Service:  GENERAL-CARD INALITY] 

VAR-MAPP INGS :  ( (<C . GENERAL-CARDINALITY . 5>  .  ?JX5) ) 

PATTERN:  ([Context-spec:  CARDINALITY] ) 

CLAUSE  : 

(IS-TERM 

#S (CONTEXT 

: OPERATOR  CARDINALITY 
: FREE-VARS  NIL 
: LOCAL -VARS  NIL 
: OPERATOR-VAR  ?JX1 
: CLASS -EXP 

((IN. CLASS  ?JX1  CRUISER) 

(VESSEL-OVERALL-READINESS-OE  ?JX1  Cl)) 

: STAT-VAR  NIL 
; CONSTRAINTS  NIL) 

?  JX5 ) 

EMBEDDED-SOLUTIONS:  ((PS:  9.0  ]  NIL) 

EMBEDDED-MAPPINGS : 

(((CRUISER  .  <w . CRUISER23 . 2>) 

(?JX1  .  <x . CRUISER23 . 2>) 

(? JXb  .  <y .NAMEOF328 . 3>) 

(?JX1  .  <x . NAMEOF  328 . 3  > ) 

(Cl  .  <y . VESSEL -OVERALL -RE AD INESS -OF859 . 1>) 

(?JX1  .  <x . VESSEL-OVERALL-READ INESS -OF85 9 . 1>) 

((? JX6)  .  <X. VALUE-EXISTS-RESPONSE . 4>) ) 

NIL 

((?JX1  .  <X.GENERAL-CARDINALITY>) ) ) 

The  expression  embedded  in  the  cardinality  clause  'c!ass-c\p\  was  solved  recursively  in  the  course  of  the  matching  ' 
The  solution,  which  retrieves  all  the  Cl  cruisers,  is  the  first  element  in  the  EMBEDDED- SOLUTIONS  field,  above 
The  second  element  in  EMBEDDED-SOLUTIONS  is  NIL  because  there  was  no  constraint  expression  --  had  this 
context  been.  say.  a  universal  quantification  < FOR.XLL  car  ctass-exp  constraints,,  then  thix  second  element  would 
have  been  the  solution  to  the  constraints  expression 


5.3  The  Formulation  Algorithm 


The  formulation  algorithm  is  shown  below. 


£mNr<id<rJ  snluihrfjs  j re  compute  J  onlv  oik c.  of 


t'ur^c.  regardless  of  h***  much  pattern -nutc hinw  «»n 
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Find  the  set  P  of  imna! partial  solutions  for  request  expression. 

Choose  the  "best"  n  elements  px,  .  . . ,pn  in  P,  and 
let  focus  set  F  =  (px , . . . , pn } 

While  no  member  of  F  is  a  complete  solution 

Choose  the  n  best  elements  . fn  of  F 

For  each  f^ 

"Age"  f x ,  reducing  its  goodness  by  some  factor 

Choose  the  best  element  p  from  P  that  can  be  combined  with  ^ 
Let  £  '  =  combine-partials (fl,  p) 

Add  fi'  to  F 

If  no  member  of  F  was  expanded, 

Report  that  no  solution  was  found 

Else 

Let  F  =  the  best  n  elements  of  F,  plus  complete  solutions  in  F 


Each  minci!  partial  solution  is  essentially  an  instance  ol  a  service  whose  partem  has  been  completely  matched 
by  part  of  the  expression  These  are  found  by  first  restricting  the  search  to  those  services  matching  some  predicate 
appearing  in  the  pattern  (recall  the  discussion  of  how  services  are  indexed,  in  section  Jf.  then  doing  more  complete 
pattern-matching  on  the  restricted  set  The  set  of  initial  partial  solutions  includes  all  possible  ways  to  match  a 
service  to  a  pattern  For  example,  given  an  expression 

(AND  (P  ?JX1)  .clause  (1) 

(P  ?JX2)  .clause  (2) 

(Q  ? JX3 ) )  .clause  (3) 

and  a  serv  ice  vv  ith  the  pattern 
(AND  (P  <x>) 

<Q  <Y»  ) 

there  w  ill  be  nw  miti.il  partial  solutions  produced,  one  in  which  the  serv  ice  h.is  matched  clauses  <  I  i  and  (2 1.  and  the 
other  in  whreh  the  sere  ice  has  matched  clauses  1 1  i  and  1 3 1 

Beginning  with  a  focus  set  of  initial  partial  solutions,  the  formulation  algonthm  seeks  to  expand  solutions  in 
the  focus  set  Aging  elements  in  the  locus  set  --  that  is.  reducing  their  goodness  by  some  small  factor  each 
iteration  --  results  in  throwing  out  nonproductive  solutions  after  a  while  This  algonthm  is  not  complete  it  is 
possible  that  no  member  of  the  initial  locus  set  will  provide  a  successful  starting  point,  one  might  consider  adding  a 
step  in  the  m  lii'f  loop  that  allows  new  starting  points  to  be  added  to  the  focus  set  Nor  does  the  greedy  heunstic  used 
guarantee  finding  the  optimal  solution 


Partial  solutions  cun  only  be  combined  if  there  is  a  connection  between  them  one  vanable  provided  as  output 
by  one  solution  must  be  desired  as  input  by  the  other  The  combination  does  not.  however,  cement  these 
input  output  links,  establishing  such  data  dependencies  is  the  job  of  the  execution  planner  i  see  section  b).  Alter 
combining  two  partial  solutions,  the  resulting  partial  solution's  input  is  the  union  of  the  component  solutions’  input.-, 
and  the  outputs  include  any  output  provided  by  a  component  partial  solution 
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5.4  Formulation  and  Embedded  Contexts 

Notice  that  the  formulation  algorithm  does  not  include  recursive  calls  to  itself  Calls  to  the  formulation 
algorithm  for  handling  embedded  expressions  tie  ,  the  class  expression  or  constraint  expression  of  a  context  i  are 
made  as  part  of  the  pattern-matching  process  (see  section  5  2).  It  is  necessary  to  find  all  the  possible  ways  that  the 
embedded  expressions  can  be  solved,  before  doing  panem-maching  at  the  top  level  When  seeking  services  that  will 
cover  a  context  clause,  the  pattern  matcher 

l.  Finds  the  set  S  of  serv  ices  whose  pattern  includes  some  coi.iext-specification  whose  operator  matches 
this  context's  operator  (eg.  CARD,  FOR  ALL) 

2  Finds  a  collection11  C,  of  solutions  to  the  context's  class  expression  by  recursively  calling  the 
formulation  algorithm  on  it 

3.  Finds  a  collection  C-,  of  solutions  to  the  context's  constraint  expression,  in  the  same  manner 

4  Takes  the  cross-product  of  C|  and  C to  form  a  set  C'  of  embedded  (or  internal i  solutions  tor  the 
context  clause  Each  embedded  solution  represents  one  possible  way  to  solve  the  embedded 
expressions  10 

5.  For  each  service  in  S  and  for  each  embedded  solution  in  C.  checks  whether  S  matches  the  context 
clause  assuming  that  particular  embedded  solution 

Some  services  in  S  make  use  of  the  embedded  solution  information  in  matching  elements  in  this  set  of 
pattern  clauses  (see  the  discussion  of  the  pattern  field  in  services,  specifically  the  use  of  the 
:COVERS-OWNER  parameter,  in  section  4).  Other  services  do  not  use  the  embedded  solution 
information  for  pattern-matching  In  either  case,  of  course,  the  embedded  solution's)  will  be  used  for 
execution  planning 


i 

6  Execution  Planning 


The  job  of  the  execution-planning  phase  is  to  take  a  complete  solution  produced  by  the  formulation  algorithm 
and  prixiuce  an  execution  plan  that  makes  use  of  the  resources  specified  by  the  solution  and  provides  a  specification 
of  the  dataflow  among  those  resources 


Recall  that  the  formulation  algorithm  produces  n  solutions 

°An  exception  to  this  is  the  Ji.  ’un,  u.'n  .  onteii  in  which  the  class  expression  field  domains  an  arbitrarily  long  list  of  disjuncts.  each  of  which 
must  be  solved  recursive!)  It  each  disjunct  can  be  solved  in  several  it  is  impractical  to  take  the  c  ross  pr>>duv  t.  instejd.  we  create  <>nl\  one 

embedded  solution  usinii  the  be>t  solution  for  eac  h  J.sjynv  t 
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6.1  Partial  Kvecution  Plans 


An  execuuon  plan  is  a  sequence  of  partial  execution  plans,  each  of  whuh  consists  of  the  following 

•  Owner  the  name  of  the  server  to  which  this  partial  plan  belongs,  and  which  will  execute  rh.-;  n.iriaaJ 
plan 

•  Inputs  the  inputs  as  identified  by  the  Janus  variables  (prefixed  by  "  'JX  '  i  in  the  request  that  this  partial 
plan  expects  to  receive 

•  Outputs  the  outputs  (also  identified  by  Janus  variables)  that  this  panial  plan  will  produce 

•  W  rapper  a  place  to  hold  the  data  passed  to  this  partial  plan  by  other  parti.il  plans  The  w  rapper  is  an 
object  that  includes. 

•  A  list  of  streams  --  pointers  back  to  where  the  data  came  from 

•  For  each  stream,  a  list  of  labels  --  each  label  is  a  Janus  variable  front  the  request 

•  For  each  stream,  a  bucket  containing  the  actual  tuples  passed  in  as  data 

For  example,  the  wrapper  for  a  partial  execution  plan  (after  a  previous  partial  plan  has  been  executed) 
might  look  like  this 

#S (WRAPPER  : STREAMS  (#S (PARTIAL-EXECUTION-PLAN ...) ) 

: LABELS  ((?JX1  ?JX2)) 

(BUCKETS  ((("VINCENNES''  1) 

("NIMITZ"  2) 

("FREDERICK"  1)  .  .  .  )  )  ) 

This  wrapper  has  only  received  input  from  one  place:  the  pariiu)  plan  on  the  STREAMS  list  The  data 
are  tn  the  form  of  pairs  -  the  first  element  in  each  pair  is  an  instantiation  of  the  Janus  vanable  ,’JXl  in 
the  request,  and  the  second  element  in  each  pair  is  an  instantiation  of  the  vanable  .’JX2. 

•  Body  the  code  to  be  executed  by  the  server  (i.e  .  by  the  server's  executor  function).  This  code  is  built 
by  the  serv  er's  execution  planner  function 


6.2  Partitionin';  and  Kxtahlixhino  Dependencies 

There  arc  two  processes  in  creating  an  execution  plan  from  a  complete  solution:  parntiorans  the  solution 
according  to  server,  and  setting  up  the  dependencies  among  the  nodes  in  the  partition,  based  upon  the  possible  inputs 
and  output-,  of  each  node  There  is  a  certain  circularity  here  that  makes  the  process  difficult:  one  can  not  set  up 
dependencies  until  the  solution  is  partitioned  according  to  server,  yet  in  order  to  partition  properly  (for  example,  to 
split  a  node  belonging  to  a  particular  server  into  two  nodes  in  order  to  allow  another  node  to  fit  between  them)  one 
needs  to  know  what  the  dependencies  are  The  current  implementation  does  not  handle  this  issue  particularly  well 
it  partitions  first,  then  attempts  to  set  up  dependencies. 

Assuming  that  partitioning  has  been  reasonably  done,  there  is  another  problem  of  circularity,  involving  the 
expected  inputs  and  outputs  of  each  node:  one  would  like  to  be  able  to  operate  from  a  global  perspective,  using  the 
expected  inputs  and  outputs  to  optimally  plan  dataflow  links:  on  the  other  hand,  at  the  lev"'  of  each  node,  one  w  ould 
like  the  global  planner  to  provide  the  desired  inputs  and  outputs,  so  as  to  produce  optimal  code  for  this  node  of  the 
execution  plan  In  essence,  the  global  level  says,  tell  me  what  to  expect."  and  the  local  level  says  "tell  me  what  you 
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need'"  The  current  implementation  is  limited  as  follows:  planning  at  both  levels  assumes  that  all  inputs  are 
provided,  and  all  outputs  are  required. 


6.3  Servers"  Execution  Planners 


As  discussed  in  section  4,  every  server  has  associated  with  it  an  execution  planner  function.  Execution 
planner  functions  take  three  arguments. 

1.  A  list  of  lanus  variables  it  can  expect  to  have  values  for  as  input, 

2.  A  list  of  Janus  variables  it  should  produce  as  output,  and 

3.  The  solution-object  for  the  portion  of  the  query  that  is  to  be  handled  From  the  solution  object,  one 
can  obtain  the  component  services  (and  thus  their  method  slots,  which  contain  the  necessary  code 
fragments)  and  variable  mappings  from  Janus  variables  to  service  variables. 

These  functions  should  return: 

1.  Code  to  be  executed  by  the  execution  function  (see  section  7), 

2.  Reductions  from  tuples  to  single  variables,  if  any  (e  g  .  if  the  execution-planner  determines  that  output 
variable  ?JX3  should  be  treated  as  (tuple  '.'JX1  ,'JX2i  for  information-passing,  then  the  list 
(97XJ  .'IX t  ‘IX 2 1  is  one  such  reduction),  and 

3.  The  Janus  vanables  that  the  code  will  produce  output  for 


7  Execution 


The  execution  phase  takes  an  execution  plan  ti  e.,  a  list  of  partial  execution  plans i,  and  iterates  through  it 
sequentially 

for  each  partial  execution  plan  p 

Combine  the  data  from  the  streams  in  the  wrapper  of  p 
Call  the  execution  function  for  the  owner  of  p 
Pass  the  output  tuples  (according  to  the  dataflow  links  of  p) 
into  the  wrapper  objects  of  partial  plans  further  on 
Return  the  output  provided  by  the  last  partial  execution  plan 

7.1  Combining  Data 

Previous  approaches  to  the  multiple  systems  problem  (e  g  .  [4]  and  [3])  have  assumed,  for  the  purpose  of 
execution,  a  straightforward  dataflow  model  in  which  nodes  accomplish  execution  and  arcs  are  streams  of  values. 
Unfortunately,  the  problem  of  passing  and  combining  data  among  multiple  systems  is  more  complex  than  tins  model 
will  accommodate.  In  most  cases,  it  is  necessary  to  pass  sets  of  tuples  rather  than  sets  of  values,  using  a 
generalization  of  the  join  operation  to  combine  data.  There  are  problems  that  even  this  does  not  address. 
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7.1.1  Passing  tuples,  not  salues 

Consider  a  scenario  m  which  the  user  has  requested  a  table  of  the  speeds  of  the  ships  in  the  Indian  Ocean 
that  are  faster  than  20  knots  ',  and  in  which  the  resulting  solution  involves  four  services: 
l.  Generate  generates  pairs  of  ships  and  speeds. 

2  Filter-location  filters  a  list  of  ships  according  to  whether  or  not  the^  are  in  a  given  location. 

3  Filter-speed  filters  a  list  of  numerical  speed  values  according  to  whether  or  not  thev  are  faster  than  a 
given  speed,  and 

4.  Present  presents  a  table  of  ships  and  speeds. 

In  a  model  in  which  streams  of  values  are  passed,  generate  will  pass  a  stream  of  ships  to  filter-location,  w  hich  w  ill 
pass  a  filtered  stream  of  ships  to  present :  generate  will  also  pass  a  stream  of  speeds  to  filter-speed,  which  will  pass  a 
filtered  stream  of  speeds  to  present. 


GENERATE 


<SHIP,  SP£ED> 


The  problem  is  this  present  has  received  both  ships  and  speeds,  but  how  can  it  now  decide  which  speeds 
belong  to  which  ships0  The  relation  of  ships  to  speeds  was  lost  because,  although  pairs  were  generated  by  generate. 
they  were  split  up  in  order  to  pass  the  data,  once  split  up.  there  is  no  way  to  put  them  back  together  again 

An  obvious  approach  to  solving  this  problem  is  to  pass  not  streams  of  values,  but  streams  of  tuples  of  values, 
never  breaking  up  a  tuple.  This  increases  the  volume  of  data  passed,  of  course,  but  it  does  ensure  that  the 
appropriate  relationships  are  maintained 

Now  suppose  that  the  same  arcs  represent  the  passing  of  tuples  rather  than  of  individual  values.  Generate 
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generates  pairs  <ship.  speed>  of  ships  and  speed  v  alues.  Because  filter-location  requires  the  ships,  generate  passes 
all  the  pairs  to  fitter-location,  which  filters  out  those  tuples  in  which  the  ship  is  not  at  the  appropriate  location 
Filter-location  then  passes  the  filtered  set  of  tuples  to  present.  Because  filter-speed  requires  the  speed  values. 
generate  passes  all  the  pairs  to  filter- speed,  which  filters  out  those  tuples  in  which  the  speed  is  too  slow 
Filter-speed  then  passes  the  filtered  set  of  tuples  to  present. 


GENERATE 


<SH!P,  SPEED> 


The  situation  has  improved,  in  that  we  have  maintained  the  association  between  ships  and  speeds.  However. 
present  has  now  received  two  different  sets  of  ship-speed  pairs,  one  filtered  according  to  a  property  of  the  ship,  and 
the  other  according  to  a  property  of  the  speed.  How  do  we  combine  them'1 

7.1.2  Join  and  Cross-join 

A  solution  that  works  in  many  cases  is  a  database  join  across  the  attributes  that  the  streams  have  in  common. 1 1 
A  join  effectively  takes  the  cross-product  of  the  incoming  sea  ot  tuples,  and  then  removes  from  the  cross-product 
any  tuples  in  which  the  values  of  the  common  attributes  are  not  equal.  For  example,  the  join  of  a  set  of  tuples 
<ship.  location>  with  a  set  of  tuples  <ship,  speed>  across  the  attribute  ship  will  result  in  a  set  of  tuples 
< ship,  location,  speed>.  which  w  ill  include  only  values  of  ship  that  appeared  in  both  sets  of  tuples  being  joined. 

It  is  quite  possible  that  two  (or  more)  streams  of  input  will  have  no  attributes  in  common,  and  in  such  cases 
join  can  not  be  used.  In  such  a  situation,  one  would  like  to  use  a  version  of  the  join  operator  that  computes  the  cross 
product,  but  -  because  zero  attributes  are  held  in  common  --  does  not  attempt  to  do  the  filtering  operation. 

"Li  this  case,  both  attributes  --  ship  and  speed-'  alue  --  are  in  common,  so  the  join  is  just  the  intersection  of  the  rsvo  sets  of  tuples 
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For  this  reason,  we  use  a  combining  operator  called  cross-join :  when  the  incoming  streams  have  attnbutes  in 
common,  cross-join  is  equivalent  to  join:  when  there  are  no  common  attnbutes.  cross-join  is  equivalent  to  the 
cross-product. 


7.1.3  Problt'ms  with  cross-join 


In  some  cases,  the  strategy  adopted  by  cross-join  is  not  appropriate.  For  example,  suppose  the  user  has 
requested  the  commanders  and  destinations  of  all  the  ships,  and  that  one  server  generates  pairs  of 
<ship,  commanded  while  another  generates  <ship.  Jestinati.in>.  Suppose,  further,  that  each  of  these  servers  has 
only  incomplete  information  so  th  ‘t  each  produces  tuples  about  some  ships  not  known  by  the  other. 

SHIP  (?jxl)  COMMANDER  (?jx2)  SHIP  (?jxl)  PEST  (?jx3) 


VINCENT 

rox 

FREDERICK 


SMITH 

JONES 

BROWN 


VINCENT 

NIMITZ 

FREDERICK 


HAWAII 
HAWAII 
SAN  DIEGO 


In  such  a  case,  the  cross-join  operation  will  recognize  that  the  ship  attnbute  is  held  in  common  by  the 
incoming  sets  of  tuples,  and  thus  combine  the  sets  of  tuples  using  join : 

SHIP  (?jxl)  COMMANDER  C?jx2)  PEST  (?jx3 ) 

VINCENT  SMITH  HAWAII 

FREDERICK  BROWN  SAN  DIEGO 

Notice  that  as  a  result  of  join's  filtering  operation.  FOX  and  SlMri  Z  do  not  appear  in  the  output  data  Considering 
the  fact  that  the  user  requested  the  commanders  and  destinations  of  oil  the  ships,  this  is  an  undesirable  result:  it  is 
likely  that  the  user  wants  to  see  whatever  information  is  available  about  each  ship  even  if  that  information  is 
incomplete. 

This  example  serves  to  illustrate  that  cross-join  is  not  appropriate  in  all  instances.  Unfortunately  ,  there  is 
currently  no  easy  way  to  identify  such  cases  For  the  present  time,  cross-join  is  always  used  to  combine  data 


7.2  Ser\erV  Execution  f  unctions 

The  execution  function  (or  executor  i  for  a  server  is  a  function  taking  the  following  arguments. 
1  A  list  of  tuples  representing  input  values. 

2.  A  sequence  of  Janus  variables  identify  ing  the  tuple  elements,  and 

3.  Code  produced  by  the  execution  planner 

The  execution  function  should  return  two  values: 

1.  A  list  cf  tuples  representing  output  values,  and 

2.  A  sequence  of  Janus  variables  identifying  the  tuple-elements. 
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8  Status  and  Extensions 


8.1  Experience 

The  ML'S  component  described  in  this  documentation  has  been  successfully  implemented  and  used  in  tlx* 
domain  of  the  Fleet  Command  Center  Battle  Management  Program  (FCCBSfP),  using  an  internal  version  of  the 
Integrated  Database  (IDB)  -  a  relational  database  —  as  one  underlying  resource,  and  a  set  of  LISP  functions  as 

another.  The  system  includes  more  than  800  services,  and  produces  an  execution  plan  for  a  typical  request  in 

seconds  or  fractions  of  seconds,  it  also  reports  failure  to  create  an  execution  plan  within  seconds  (for  example,  in 
cases  where  no  service  exists  covering  part  of  a  request  expression).  Queries  handled  include  those  involving 
negation  of  simple  predicates,  existential  and  universal  quantification,  cardinality,  and  the  most  common 
disjunctions.1-  as  well  as  queries  that  are  simply  conjunctions  of  clauses.  Both  queries  requesting  values  (actually, 
tuples  of  values  are  returned)  and  yes/no  queries  are  handled. 

An  earlier  version  of  the  system  desenbed  here  was  successfully  used  within  an  expert  system  project,  in 

which  Janus  provided  natural  language  access  to  data  in  Intellicorps  KEE  knowledge-base  system  to  objects 

representing  hypothetical  world-states  m  a  simulation  system  and  to  LISP  functions  capable  of  manipulating  this 
data. 


8.2  Limitalions  and  Extensions 


There  are  several  limitations  and  possible  extensions  in  the  current  implementation  of  the  system: 

1.  Although  underlying-system-iWt'ptvii/iw  rewrite  rules  are  supported,  underiying-system-c/t'peniienr 
rewrites  are  not.  In  order  to  allow  these,  it  is  necessary  to  modify  the  formulation  algorithm  so  that, 
rather  than  expanding  an  intermediate  partial  solution,  one  can  modify  it  by  applying  a  system- 
dependent  rewrite  One  would  need  to  avoid  combining  partial  solutions  evolved  from  different 
request  expressions,  as  could  be  the  case  if  different  rewrites  have  applied 

2.  The  formulation  algonthm  is  not  complete:  it  is  possible  that  none  of  t^'  imtiul  parual  solutions 
chosen  as  starting  points  can  be  expanded  into  a  solution  that  covers  the  entire  request.  One  might 
consider  using  a  different  search  algorithm  (e  g.,  a  modification  of  the  A*  search  employed  by  [3])  or 
modifying  the  algorithm  to  make  it  complete 

3.  The  model  of  service  costs,  assuming  a  single  scalar  cost  value,  is  too  simple  for  many  likely  real- 
world  situations.  A  better  model  would  distinguish  aspects  of  cost  like  the  reliability  of  the  service's 
data,  the  cost  of  communicating  with  it.  and  the  service's  time  and  space  requirements. 

4.  The  current  execution  model  assumes  that  the  representations  of  entities  are  the  same  in  different 
underlying  sy  stems.  This  is  a  severe  limitation  and  should  be  addressed  as  soon  as  possible  There  are 
three  immediately  apparent  cases: 

•  Spelling  variations.  For  example,  one  system  may  store  the  name  of  a  ship  as  the  "CARL 
VINSON"  while  another  stores  it  as  "VINSON  C1 ", 


1  2Tho'.e  cvprcsMnn  member, hip  in  .1  -ct.  e.2  .  iOR  t  item!  ,  r£(J  *  uemC*  i. 
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•  Different  internal  representations.  For  example,  one  system  may  store  "the  ship  itself  as  an  ID 
number  te  g  ”0123  )  while  another  uses  the  ship's  name  (e  g.  "VINSON  C  >. 

•  Differing  data  decompositions.  For  example,  one  system  mas  store  dates  as  single  values  <e  g 
date=' 07-13-65"  i  while  another  stores  them  as  several  values  te  g  nionth=07.  day- 1 3. 
year=65 ». 

5.  Error  recovery.  A  single  plan  is  created  in  response  to  a  request,  arid  if  it  fails,  tlie  system  has  no 
recourse  hut  to  report  an  error.  It  should  be  possible  to  modify  the  formulation  and  execution-planning 
phases  of  processing  to  allow  the  creation  of  alternative  plans.  Notice  that  this  still  does  not  address 
the  reasons  for  plan  failure:  if  a  disk  error  has  been  encountered  while  making  use  of  a  resource,  it 
makes  no  sease  to  try  again  with  a  different  plan  that  requires  the  same  resource. 
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